Supervised Consensus Scoring Methods for Structure-Based Drug Design
نویسندگان
چکیده
Molecular docking is widely used to discovery novel ligands in structure-based drug design (SBDD) [1]. Over the past 15 years, various docking programs have been developed. Generally, flexible docking programs, such as DOCK, AutoDock, FlexX, GOLD and Glide, have the ability to predict protein-ligand complex structures with reasonable accuracy and speed. These docking programs are aimed at predicting the binding mode of a ligand and discriminating between active and inactive compounds. They involve two computational steps: docking and scoring. In docking step, many ligand conformations, called decoys, are generated. Next, a scoring function is used to evaluate the protein-ligand affinity. The final docked conformations are selected according to their scores. Accordingly, scoring functions play important roles in SBDD [2, 3]. Although considerable efforts have been devoted to design scoring functions to describe protein-ligand interactions, it is pointed out that the major weakness of docking programs lies in scoring functions [2]. There are three groups in scoring functions: force field based methods, empirical scoring functions and knowledge-based potentials. Each scoring functions has specific advantages and disadvantages and there is no standard method. Recently, it is reported that consensus scoring method can improve the performance by compensating for the deficiencies of each scoring functions [2, 3]. However, conventional consensus scoring, such as rank-by-rank, rank-by-number, average rank, and linear combination of multiple scoring functions, does not address the problem of designing accurate scoring function essentially and improve the performance enough. In order to cope with the problem, we propose the supervised consensus scoring method (SCS) to incorporate free energy landscape between the protein and the ligand via the decoy structures using supervised learning. We show that SCS outperforms conventional consensus scoring in predicting binding mode and discuss why SCS improves the performance.
منابع مشابه
Scoring functions and their evaluation methods for protein-ligand docking: recent advances and future directions.
The scoring function is one of the most important components in structure-based drug design. Despite considerable success, accurate and rapid prediction of protein-ligand interactions is still a challenge in molecular docking. In this perspective, we have reviewed three basic types of scoring functions (force-field, empirical, and knowledge-based) and the consensus scoring technique that are us...
متن کاملModeling Transcription Factor Binding Sites with Supervised Learning
We present a supervised learning approach to transcription factor binding site modeling for four distinct species. Using the consensus scoring method, we look at binding sites of unequal length and the alignment strategy associated with these binding sites. Pairwise scoring and information content were added to the consensus scoring to further increase accuracy of transcription factor binding s...
متن کاملReducing Annotation Efforts in Supervised Short Answer Scoring
Automated short answer scoring is increasingly used to give students timely feedback about their learning progress. Building scoring models comes with high costs, as stateof-the-art methods using supervised learning require large amounts of hand-annotated data. We analyze the potential of recently proposed methods for semi-supervised learning based on clustering. We find that all examined metho...
متن کاملSSHMT: Semi-supervised Hierarchical Merge Tree for Electron Microscopy Image Segmentation
Region-based methods have proven necessary for improving segmentation accuracy of neuronal structures in electron microscopy (EM) images. Most region-based segmentation methods use a scoring function to determine region merging. Such functions are usually learned with supervised algorithms that demand considerable ground truth data, which are costly to collect. We propose a semi-supervised appr...
متن کاملFunction-specific virtual screening for GPCR ligands using a combined scoring method
The ability of scoring functions to correctly select and rank docking poses of small molecules in protein binding sites is highly target dependent, which presents a challenge for structure-based drug discovery. Here we describe a virtual screening method that combines an energy-based docking scoring function with a molecular interaction fingerprint (IFP) to identify new ligands based on G prote...
متن کامل